Machine Translation of TV Subtitles for Large Scale Production
نویسندگان
چکیده
This paper describes our work on building and employing Statistical Machine Translation systems for TV subtitles in Scandinavia. We have built translation systems for Danish, English, Norwegian and Swedish. They are used in daily subtitle production and translate large volumes. As an example we report on our evaluation results for three TV genres. We discuss our lessons learned in the system development process which shed interesting light on the practical use of Machine Translation technology. Machine Translation of TV Subtitles for Large Scale Production Martin Volk, Rico Sennrich University of Zürich Computational Linguistics CH-8050 Zurich (volk|sennrich)@cl.uzh.ch Christian Hardmeier Fondazione Bruno Kessler Human Language Technologies I-38123 Trento [email protected] Frida Tidström University of Stockholm Datorlingvistik SE-10691 Stockholm [email protected]
منابع مشابه
Disambiguation of English Contractions for Machine Translation of TV Subtitles
This paper presents a disambiguation method for English apostrophe+s contractions. They occur frequently in subtitles and pose special difficulties for Machine Translation. We propose to disambiguate these contractions in a pre-processing step and show that this leads to improved translation quality.
متن کاملAutomatic Speech Recognition and Hybrid Machine Translation for High-Quality Closed-Captioning and Subtitling for Video Broadcast
We describe a system to rapidly generate high-quality closed captions and subtitles for live broadcasted TV shows, using automated components, namely Automatic Speech Recognition and Machine Translation. The human stays in the loop for quality assurance and optional postediting. We also describe how the system feeds the human edits and corrections back into the different components for improvem...
متن کاملFinding Alternative Translations in a Large Corpus of Movie Subtitle
OpenSubtitles.org provides a large collection of user contributed subtitles in various languages for movies and TV programs. Subtitle translations are valuable resources for cross-lingual studies and machine translation research. A less explored feature of the collection is the inclusion of alternative translations, which can be very useful for training paraphrase systems or collecting multi-re...
متن کاملAn Analysis of Audiovisual Subtitling Translation Focusing on Wordplays from English into Persian in the Friends TV Series
Translation of humor and transferring its effect is one of the most challenging tasks of a translator due to the cultural clashes between the source language (SL) and the target language (TL). Accordingly, the pre- sent study aimed to specify the most frequently applied strategies in terms of Delabastita’s wordplay model used in SL and their translation strategy by Persian translators acc...
متن کاملTranslating DVD subtitles from English-German and English-Japanese using Example-Based Machine Translation
Due to limited budgets and an ever-diminishing time-frame for the production of subtitles for movies released in cinema and DVD, there is a compelling case for a technology-based translation solution for subtitles (O’Hagan, 2003; Carroll, 2004; Gambier, 2005). In this paper we describe how an Example-Based Machine Translation (EBMT) approach to the translation of English DVD subtitles into Germ...
متن کامل